Scaffold Topologies. 2. Analysis of Chemical Databases
نویسندگان
چکیده
We have systematically enumerated graph representations of scaffold topologies for up to eight-ring molecules and four-valence atoms, thus providing coverage of the lower portion of the chemical space of small molecules (Pollock et al. J. Chem. Inf. Model., this issue). Here, we examine scaffold topology distributions for several databases: ChemNavigator and PubChem for commercially available chemicals, the Dictionary of Natural Products, a set of 2742 launched drugs, WOMBAT, a database of medicinal chemistry compounds, and two subsets of PubChem, "actives" and DSSTox comprising toxic substances. We also examined a virtual database of exhaustively enumerated small organic molecules, GDB (Fink et al. Angew. Chem., Int. Ed. 2005, 44, 1504-1508), and we contrast the scaffold topology distribution from these collections to the complete coverage of up to eight-ring molecules. For reasons related, perhaps, to synthetic accessibility and complexity, scaffolds exhibiting six rings or more are poorly represented. Among all collections examined, PubChem has the greatest scaffold topological diversity, whereas GDB is the most limited. More than 50% of all entries (13 000 000+ actual and 13 000 000+ virtual compounds) exhibit only eight distinct topologies, one of which is the nonscaffold topology that represents all treelike structures. However, most of the topologies are represented by a single or very small number of examples. Within topologies, we found that three-way scaffold connections (3-nodes) are much more frequent compared to four-way (4-node) connections. Fused rings have a slightly higher frequency in biologically oriented databases. Scaffold topologies can be the first step toward an efficient coarse-grained classification scheme of the molecules found in chemical databases.
منابع مشابه
Scaffold Topologies. 1. Exhaustive Enumeration up to Eight Rings
Mapping the chemical space of small organic molecules is approached from a theoretical graph theory viewpoint, in an effort to begin the systematic exploration of molecular topologies. We present an algorithm for exhaustive generation of scaffold topologies with up to eight rings and an efficient comparison method for graphs within this class. This method uses the return index, a topological in...
متن کاملThe Molecule Cloud - compact visualization of large collections of molecules
BACKGROUND Analysis and visualization of large collections of molecules is one of the most frequent challenges cheminformatics experts in pharmaceutical industry are facing. Various sophisticated methods are available to perform this task, including clustering, dimensionality reduction or scaffold frequency analysis. In any case, however, viewing and analyzing large tables with molecular struct...
متن کاملLigand based lead generation - considering chemical accessibility in rescaffolding approaches via BROOD
In pharmaceutical industry ligand based approaches like scaffold hopping, scaffold decoration and me-too approaches, are used to generate lead structures in discovery projects. We use several tools to generate novel lead structures, such as BROOD [1]. BROOD is a software tool which explores chemical space around query molecules based on shape similarity and electrostatics, and it generates anal...
متن کاملThe controlled release of dexamethasone sodium phosphate from bioactive electrospun PCL/gelatin nanofiber scaffold
In this study, a system of dexamethasone sodium phosphate (DEXP)-loaded chitosan nanoparticles embedded in poly-ε-caprolacton (PCL) and gelatin electrospun nanofiber scaffold was introduced with potential therapeutic application for treatment of the nervous system. Besides anti-inflammatory properties, DEXP act through its glucocorticoid receptors, which are involved in the inhibition of astroc...
متن کاملScaffold Hunter: Facilitating Drug Discovery by Visual Analysis of Chemical Space
The search for a new drug to cure a particular disease involves to find a chemical compound that influences a corresponding biological process, e.g., by inhibiting or activating an involved biological target molecule. A potential drug candidate however does not only need to show a sufficient amount of biological activity, but also needs to adhere to additional rules that define the basic limits...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of chemical information and modeling
دوره 48 7 شماره
صفحات -
تاریخ انتشار 2008